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(54) Digital ink annotation process and system for recognizing, anchoring and reflowing digital 
ink annotations 



(57) A digital ink annotation process and system for 
processing digital documents and digital ink annotations 
therein. The process and system maintain an annota- 
tion's position within a document such that the original 
intent and meaning of the annotation is preserved. This 
is true even if the document is edited, resized, displayed 
on a different device or otherwise modified. The digital 
ink annotation process includes automatic and manual 
grouping of digital ink strokes within a document to de- 
fine digital ink annotations, classifying the annotations 



according to annotation type, and anchoring the anno- 
tations to appropriate regions or positions in a docu- 
ment. The process further includes reflowing the anno- 
tations in a new document layout such that the annota- 
tions conform and adapt to the new layout while preserv- 
ing the original intents and meanings of the annotations. 
A digital ink annotation system includes a classification 
module, an anchoring module, a reflow module and a 
clean-up module to implement the digital ink annotation 
process. 
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Description 
TECHNICAL FIELD 

[0001] The present invention relates in general to dig- 
ital document processing and more particularly to a 
process and system for processing freeform digital ink 
annotations made on a digital document such that when 
the digital document's layout is modified (either because 
it is edited or displayed on different displays), the anno- 
tations' layout adapts accordingly. 

BACKGROUND OF THE INVENTION 

[0002] Annotating paper documents with a pen is a 
familiar and indispensable activity across a wide variety 
of business and educational environments. Annotating 
a document is the act of "marking up" or placing critical 
and explanatory notes and remarks on the document. 
These notes and remarks may by textual, graphical, or 
both. 

[0003] As pen-based computing devices such as pen 
computers, Tablet PCs and personal digital assistants 
(PDAs) become increasingly popular among consum- 
ers, the ability to annotate digital documents becomes 
highly useful and important. Pen-based computing de- 
vices utilize an electronic pen (called a stylus) instead 
of a keyboard for input. Pens are used for input because 
in many situations the computing devices are too small 
to incorporate a keyboard. In addition, there are numer- 
ous situations where a pen together with a notepad is 
more convenient for the user that a keyboard. These 
pen-based computing devices generally have special 
operating systems that support handwriting recognition, 
which allows a user to interface with the device by writ- 
ing on a screen or on a tablet instead of typing on a key- 
board. 

[0004] Despite the availability and usefulness of pen- 
based computing devices, when it comes to reading and 
annotating documents the majority of people still prefer 
pen and paper. One key reason is that pen and paper 
offer a reader an easy way for the reader to sketch un- 
structured or freeform notes and drawings in response 
to document content. 

[0005] Notwithstanding the advantages offered by pa- 
per documents during the annotation process, digital 
documents and annotations tend to be more flexible 
than their paper counterparts. For example, digital doc- 
uments can be more easily edited and adapted to con- 
form to different display sizes than paper documents. In 
addition, while annotated paper documents often end 
up in the trash bin, digital annotations can persist 
throughout the lifetime of a digital document. Moreover, 
digital documents and annotations can be more easily 
filtered, organized, and shared. 

[0006] Because of this flexibility, digital documents 
can be used in diverse environments. For example, a 
single digital document may be read is several different 
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formats, displayed on a variety of devices, and present- 
ed within a variety of window sizes. The digital document 
may be edited, combined with other documents, and 
may even dynamically adapt its contents. Thus, unlike 
5 a paper document, a digital document generally does 
not have a permanent layout. 

[0007] This lack of a permanent layout presents a sig- 
nificant technological challenge when considering free- 
form digital ink annotations on a digital document. Each 

10 time a digital document adapts to a new layout, for in- 
stance, the associated annotations must also be adapt- 
ed. In other words, when a digital document is edited or 
displayed on different devices or in different window siz- 
es, the layout of the digital document changes to adapt 

is to the new situation. Any "digital ink" annotations made 
on the digital document must likewise be adapted (or 
"reflowed") to the new layout conditions. 
[0008] In order to support reflowing freeform digital ink 
annotations, three problems must be solved: First, the 

20 annotations that the reader is creating must be recog- 
nized and classified as one of several types of annota- 
tions (e.g. "underline," "circle," "margin comment," etc.). 
Next, each annotation must be anchored to the particu- 
lar place in the document where the annotation belongs. 

25 Finally, if the underlying document's layout changes at 
some point in the future, existing annotations on the doc- 
ument must be properly reflowed so they continue to 
agree with the reader's original intent. 
[0009] Historically, much of the research and devel- 

30 opment effort associated with digital ink has centered 
on handwriting recognition. The digital ink annotation 
process and system disclosed herein, however, does 
not use handwriting recognition. This is because it is 
possible to reflow digital ink annotations simply by know- 

35 ing that they are or are not handwriting (which is a high- 
level classification task), without knowing specifically 
what they say (which is a fine-grained recognition task). 
[0010] More recently, several approaches have used 
ink shape recognition to support a variety of sketch- 

40 based interfaces. These approaches have used heuris- 
tics or machine learning techniques to recognize a set 
of shapes or gestures. However, when considering their 
use for support of digital ink annotation, one shortcom- 
ing with these approaches is that they do not anchor ink 

45 strokes with an independent context such as an under- 
lying document, and thus cannot modify the user's ink 
in response to changes in this context. 
[0011] There is at least one approach that applies 
shape recognition to digital ink annotations, then use 

so this to adapt freeform ink annotations to constrained 
changes in viewing conditions (such as changing the 
font size in a document). This approach originally tried 
simple heuristics to perform recognition, however this 
was found to be insufficient. Subsequently, this ap- 

55 proach relied on machine learning techniques for rec- 
ognition, and "super-local" anchoring for associating 
parts of inkstrokes with individual context features. One 
problem with this approach has been that it does not 
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include manual classification. Manual classification 
should be included in a system such that the user is al- 
lowed to make manual selections when the automatic 
classification, anchoring, or reflow fails or is not feasible. 
Also, "super-local" anchoring (e.g. where parts of indi- 
vidual ink strokes are separately anchored to document 
context features) is appropriate for adapting annotations 
to the kind of constrained changes in viewing conditions 
that this approach was concerned with, however, it is 
insufficient for support of fully unconstrained reflow of 
digital ink annotations, since the transformation of an 
annotation must take into account all ink strokes in- 
volved in the annotation and all features in an annota- 
tion's anchor context. 

[0012] Therefore, there exists a need for a document 
processing system and method that overcomes the 
aforementioned problems and limitations. In particular, 
a system and method are needed that preserve the in- 
tent and visual meaning of digital ink annotations when- 
ever an original document is modified. 

SUMMARY OF THE INVENTION 

[0013] The invention disclosed herein includes a dig- 
ital inkannotation process and system for preserving the 
intent and meaning of digital ink annotations in an orig- 
inal document whenever the original document takes on 
a new layout as a result of being edited or displayed on 
a different display device or in a different window size. 
In general, the process and system include three broad 
aspects. First, as a user in marking up a document, their 
digital ink strokes are grouped to define an annotation 
and the annotation is classified as one of several anno- 
tation "types." These annotation "types" include under- 
lines, highlights, margin brackets, circles, marginalia, 
and callout or connector annotations. Second, each an- 
notation must be anchored to its surrounding context in 
the document. Finally, when the layout of the underlying 
or original document changes, each annotation must be 
transformed to agree with and conform to the new layout 
of its context. This final step is called "reflowing" the doc- 
ument and annotations. These three aspects allow the 
digital ink annotation system and process disclosed 
herein to process digital ink annotations on a digital doc- 
ument such that the annotations "keep up" with where 
they belong in the document. This is true even if the doc- 
ument is edited, resized, displayed on a different device 
or otherwise modified. 

[0014] In general, the digital inkannotation system in- 
cludes several components to process the digital docu- 
ment and annotations while ensuring the correct posi- 
tioning of the annotations in a reflowed digital document. 
The system includes a classification module, an anchor- 
ing module, a reflow module, and a clean-up module. 
The classification module groups digital ink strokes to 
define an annotation and then classifies that annotation. 
The anchoring module abstracts a 2D layout model of 
the document and uses robust anchoring techniques to 
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connect (or "anchor") the annotation to a region in the 
digital document that the annotation is associated with. 
The reflow module uses the classification and anchor 
context information associated with each annotation to 
5 reflow or rerender the annotation based on changes to 
the digital document. The reflow module insures pres- 
ervation of the user's original intent and meaning when- 
ever the digital document is modified. The clean-up 
module is an optional component that can be used to 
to eliminate the user's original freehand annotations and 
redraw formalized "cleaned-up" versions of the annota- 
tions. The clean-up module stylizes each annotation by 
initially determining its classification. Then, based on 
classification clean-up rules, the freehand annotation is 
is converted to a stylized annotation. 

[0015] The classification module encapsulates the 
process by which proximate ink strokes are grouped and 
classified on a digital document to define a digital ink 
annotation. It does this by extracting and examining fea- 
20 tures of the digital ink strokes to determine an initial 
grouping and classification, then refining this grouping 
and classification based on an hypothetical anchor con- 
text forthe annotation in the digital document. Based on 
the refined grouping and classification result, a better 
25 estimate for the anchor context is identified, and so on. 
After iterating back and forth between grouping and 
classification, and anchor context identification, the sys- 
tem converges to a grouping and classification result 
with associated confidence and anchor context. If the 
30 grouping and classification process converges to a low- 
confidence estimate for a particular annotation's classi- 
fication and/or anchor context (e.g. below a certain 
threshold), then it is determined that manual grouping 
should be used, and the user is consulted. 
35 [0016] Grouping of ink strokes may be based on a 
temporal order of strokes, spatial arrangement, or a 
combination of both. The digital ink annotation process 
automatically recognizes at least six types or classifica- 
tions of digital ink annotations: underline annotations, 
to highlight annotations, marginalia annotations, circle an- 
notations, margin bracket annotations, and callout or 
connector annotations Each of these classifications has 
distinct characteristics which determine what type of an- 
chor context to expect, and what kind of reflow transfor- 
ms mation will be performed. 

[0017] Once the ink strokes of the annotation are 
grouped, the annotation is classified, and a likely anchor 
context has been identified, the anchoring module an- 
chors the annotation to its intended context. In other 
so words, the annotation must be logically anchored to a 
region or position in the document where it belongs such 
that that same region or position in the document can 
be recovered even if the document's layout, format, or 
content changes. The anchoring process defines an- 
55 choring rules for each classification of digital ink anno- 
tation, applies these anchoring rules to each digital ink 
annotation, and generates a logical anchor for each an- 
notation. In addition, the anchoring process determines 
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the validity of the anchor based on the context of the 
annotation in the document, and ascertains whether us- 
er interaction is desired based on the validity of anchor. 
For instance, if the context is determined to be missing 
as the result of an edit, the annotation's anchor is deter- 
mined to be invalid, and the user is consulted as to what 
to do with the orphaned annotation. 
[0018] If the layout of the document to which the an- 
notation is anchored subsequently changes, but the an- 
notation's anchor is not invalidated by the change, the 
reflow module rerenders the annotation based on its 
classification and its anchor context. The reflow module 
determines the new layout of the digital document, then 
transforms and rerenders the annotation to conform to 
the new layout without losing any meaning or intent of 
the annotation. 

[0019] In addition, the digital ink annotation system in- 
cludes a user feedback module. The user feedback 
module provides an interface whereby the user can in- 
teract with the system to improve the accuracy and ef- 
ficacy of the system. The feedback user interface is in 
communication with the classification module, the an- 
choring module, the reflow module, and the clean-up 
module. A user may also override the automatic fea- 
tures of the system at any time and manually perform a 
task (such as grouping or classification). 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0020] The present invention can be further under- 
stood by reference to the following description and at- 
tached drawings that illustrate aspects of the invention. 
Other features and advantages will be apparent from the 
subsequent detailed description of the invention, taken 
in conjunction with the accompanying drawings, which 
illustrate, by way of example, the principles of the 
present invention. 

[0021] Referring now to the drawings in which like ref- 
erence numbers represent corresponding parts 
throughout: 

FIG. 1 is a block diagram illustrating an annotation 
environment incorporating the digital ink annotation 
system and process disclosed herein. 
FIG. 2A is a block diagram illustrating the details of 
the an exemplary implementation of the digital ink 
annotation system shown in FIG. 1 during annota- 
tion grouping, classification and anchoring. 
FIG. 2B is a block diagram illustrating the details of 
the an exemplary implementation of the digital ink 
annotation system 100 shown in FIG. 1 during re- 
flowing of annotations. 

FIG. 3 is a block diagram illustrating the details of 
an exemplary implementation of the classification 
module shown in FIG. 2A. 

FIG. 4 is a block diagram illustrating the details of 
an exemplary implementation of the anchoring 
module shown in FIG. 2A. 
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FIG. 5 is a block diagram illustrating the details of 
an exemplary implementation of the reflow module 
shown in FIG. 2B. 

FIG. 6 is a general flow diagram illustrating the op- 
5 eration of the digital ink annotation process of the 

digital ink annotation system shown in FIGS 1, 2A 
and 2B. 

FIG. 7 is a detailed flow diagram illustrating the op- 
eration of the classification process for the digital 
io ink annotation process shown in FIG. 6. 

FIG. 8A illustrates a user annotating a digital docu- 
ment by creating digital ink strokes on the docu- 
ment. 

FIG. 8B illustrates the manual grouping and classi- 
cs fication of the annotation. 

FIG. 8C illustrates the reflowing of the document 
and annotation using the digital ink annotation proc- 
ess. 

FIG. 9 is a detailed flow diagram illustrating the op- 
20 eration of the anchoring classification process for 
the digital ink annotation process shown in FIG. 6. 
FIG. 1 0 is a detailed flow diagram illustrating the op- 
eration of the reflow classification process for the 
digital ink annotation process shown in FIG. 6. 
25 FIG. 11 A illustrates the digital ink annotation 
processing of an underline annotation. 
FIG. 11 B illustrates the digital ink annotation 
processing of a highlight annotation. 
FIG. 11 C illustrates the digital ink annotation 
30 processing of a marginalia annotation. 

FIG. 11 D illustrates the digital ink annotation 
processing of an inline circle annotation. 
FIG. 11 E illustrates the digital ink annotation 
processing of a region circle annotation. 
35 FIG. 11 F illustrates the digital ink annotation 
processing of a margin bracket annotation. 
FIG. 11 G illustrates the digital ink annotation 
processing of a connector annotation. 
FIG. 1 2 illustrates the toolbar of a working example 
to of the digital ink annotation process and system. 

FIG. 1 3 illustrates an example of a suitable comput- 
ing system environment in which the digital ink an- 
notation process and system shown in FIG. 1 may 
be implemented. 

45 

DETAILED DESCRIPTION OF THE INVENTION 

[0022] In the following description of the invention, 
reference is made to the accompanying drawings, which 

so form a part thereof, and in which is shown by way of 
illustration a specific example whereby the invention 
may be practiced. It is to be understood that other em- 
bodiments may be utilized and structural changes may 
be made without departing from the scope of the present 

55 invention. 
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I. Introduction 

[0023] People have been making pen and ink anno- 
tations on paper documents for literally hundreds of 
years. In general, an annotation is a note (either textual 
or graphical) or a symbol added to a document for com- 
ment, explanation or correction. Annotations are a nat- 
ural and intuitive means of recording one's thoughts and 
impressions about a document. In addition, annotations 
can be used to indicate errors in a document that need 
correction. 

[0024] Current technology allows a user to create dig- 
ital ink an notations on a "frozen" digital document similar 
to pen annotations on a paper document. For example, 
a Table PC user can make digital ink annotations on an 
image of a document (such as a Tagged Image File For- 
mat (TIFF) image of a document) using a stylus to write 
on a screen. The user has all the flexibility afforded him 
on paper, however he also suffers all the limitations. 
True "dynamic" digital documents - such as word 
processing documents or HTML web pages - are much 
more flexible and fluid than their paper document (or 
document image) counterparts. For example, dynamic 
digital documents can be easily resized within a window 
or reformatted for display on a larger or smaller display 
device. There are currently no techniques that support 
the adaptation of annotations on these more flexible dy- 
namic digital documents when the documents' layouts 
change. 

[0025] The digital ink annotation system and process 
disclosed herein supports digital ink annotations on a 
dynamic digital document such that annotations "keep 
up" with where they belong in the document when and 
if the document's layout changes. This is true even if the 
document is edited, resized, displayed on a different de- 
vice, or otherwise modified. By way of example, sup- 
pose that a user underlines a single-line sentence on a 
web page using a Tablet PC. If the web page is subse- 
quently reformatted for display on the smaller screen of 
a PDA, the underline may have to be split across multi- 
ple document lines. This insures that all parts of the sen- 
tence remain underlined and the user's original intent is 
preserved. 

II. General Overview 

[0026] The digital ink annotation system and process 
represents an important extension of the traditional pen 
and paper annotation paradigm, from static documents 
to dynamic documents. There are at least three primary 
aspects to the digital ink annotation system and proc- 
ess. A first aspect is that digital ink annotations are au- 
tomatically or manually recognized and classified. A 
second aspect is that the classified annotations then are 
anchored to elements (text, images, tables, etc) within 
the document. This insures that even if the document is 
modified, the annotations on it will be placed in the doc- 
ument's new layout such that the user's original intent 



and meaning is preserved. A third aspect is a technique 
for modifying ink strokes such that the digital ink anno- 
tations are appropriately displayed after a document has 
been reflowed, modified, or both. 

5 [0027] FIG. 1 is a block diagram illustrating an anno- 
tation environment incorporating the digital ink annota- 
tion system and process disclosed herein. In general, 
the digital ink annotation system 1 00 is implemented in 
the annotation environment 110 such that a user 120 

to can produce a digital document with properly flowed an- 
notations 130. The digital ink annotation system 100 
generally resides on a computing device 140. As dis- 
cussed in section VII below, this computing device 140 
typically contains a processor (not shown) for executing 

'5 the digital ink annotation process in the form of compu- 
ter-executable instructions. 

[0028] In particular, referring to FIG. 1, the user 120 
interacts with the computing device 1 40 by way of a user 
interface 150. The user interface 150 allows the user 
20 120 to view and interact with a digital document 160. In 
addition, the user 120 is able to add freeform digital ink 
user annotations 1 70 to the digital document 1 60 using 
a stylus or pen input device. The digital document 160 
and associated user annotations 1 70 are sent to the dig- 
25 jtal ink annotation system 100 for processing. As ex- 
plained in detail below, the digital ink annotation system 
1 00 processes the digital document 1 60 and user anno- 
tations 1 70 and outputs the digital document with prop- 
erly flowed annotations 130. Thereafter, the digital doc- 
30 ument with properly flowed annotations 130 is treated 
as if it were the digital document 1 60: The user may add 
further annotations (or otherwise interact with the doc- 
ument, such as deleting or moving existing annotations, 
editing the document contents, etc), and these com- 
as mands are referred to the digital ink annotation system 
100 for subsequent processing. In addition, user 120 
can also interact directly with the digital ink annotation 
system 100 through the user interface 150, for instance 
to control annotation system parameters and preferenc- 
w es. 

III. System Details 

[0029] The digital ink annotation system 1 00 includes 
45 several components. These components allow the sys- 
tem 100 to process the digital document 160 and the 
user annotations 1 70 and insure the correct positioning 
of the user annotations 170 in a reflowed digital docu- 
ment. 

so [0030] FIGS. 2A and 2B are block diagrams illustrat- 
ing the details of an exemplary implementation of the 
digital ink annotation system 100 shown in FIG. 1. In 
general, as the user is adding new annotations to the 
document, the digital ink annotation system 100 proc- 

55 esses the digital document 1 60 and associated user an- 
notations 170 to produce the digital document with prop- 
erly flowed annotations 1 30. The digital document with 
properly flowed annotations 130 may subsequently be 
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modified outside the scope of the digital ink annotation 
system 100 by some external modification process (for 
instance, the document may be edited, or it may be dis- 
played in a different window size), which results in a 
modified annotated digital document 135. In this case, 
the modified annotated digital document 135 is fed as 
input back into the digital ink annotation system 100 so 
that its annotations may be properly reflowed to keep 
up with where they belong in the document's new layout, 
and a new digital document with properly flowed anno- 
tations 130 is produced. 

[0031] FIG. 2A is a block diagram illustrating the de- 
tails of an exemplary implementation of the digital ink 
annotation system 100 shown in FIG. 1 during annota- 
tion grouping, classification and anchoring. In particular, 
the digital ink annotation system 100 shown in FIG. 2A 
includes a classification module 200 and an anchoring 
module 21 0, The classification module 200 takes as in- 
put the digital ink strokes comprising the user annota- 
tions 170, along with the digital document 160, and 
groups the ink strokes together to define grouping, clas- 
sification, and anchor context combinations 230 for 
each annotation. 

[0032] The anchoring module 210 uses robust an- 
choring techniques to connect (or "anchor") the ink 
stroke groupings emitted by the classification module 
200 to their associated anchor context (e.g. to the region 
in the digital document that the group most likely is as- 
sociated with). The anchoring module 21 0 takes as input 
the grouping, classification, and anchor context combi- 
nations 230 emitted by the classification module 200, 
and the digital document 160. For each grouping, the 
anchoring module 21 0 extracts anchor features from the 
digital document 1 60 that describe the anchor context, 
and then associates these features with the grouping. 
After it has processed all groupings, the anchoring mod- 
ule 210 outputs the digital document with properly 
flowed annotations 130. 

[0033] As shown in FIG. 2B, the digital document with 
properly flowed annotations 130 may subsequently be 
modified by some external modification process 240 
outside the scope of the digital ink annotation system 
100 to produce a modified annotated digital document 
1 35. The external modification process 240 may include 
any changes - such as edits to the document contents, 
or display on a different device or in a different window 
size - which produce changes in the document's 2D lay- 
out, but which neglect to appropriately modify the anno- 
tations. As a result, the annotations may appear to be 
incorrect, and thus must be transformed to agree with 
the document's new layout in order to appear to "keep 
up" with where they belong. 

[0034] FIG. 2B is a block diagram illustrating the de- 
tails of an exemplary implementation of the digital ink 
annotation system 100 shown in FIG. 1 during reflowing 
of annotations. The digital ink annotation system 100 in- 
cludes a reflow module 250 and a clean-up module 260. 
The reflow module 250 takes as input the modified an- 



10 

notated digital document 135 and produces as output a 
new digital document with properly flowed annotations 
270. That is, the reflow module 250 transforms and 
rerenders the annotations that are present on the mod- 

5 ified annotated digital document 135 to adapt to the doc- 
ument's new layout. It does this by examining each in- 
dividual annotation's logical anchor features to discover 
where its anchor context has moved to in the docu- 
ment's new layout. Each different type or class of anno- 

10 tation (underline, circle, margin comment, etc) has its 
own set of rules governing how that type of annotation 
is to be reflowed. Once the reflow module 250 deter- 
mines where the annotation's anchor context has gone, 
it chooses the appropriate set of reflow rules to use for 

'5 the annotation's type or classification, and transforms 
the annotation. This manner of reflow insures that the 
user's original intent and meaning is preserved when- 
ever the digital document with properly flowed annota- 
tions 130 changes. 

20 [0035] The clean-up module 260 is an optional com- 
ponent (as indicated by the dashed box) that replaces 
the user's original freehand ink annotations with formal- 
ized "cleaned-up" versions of the annotations. As ex- 
plained below, some annotations are better left pre- 

25 served as the user drew them, so they pass through the 
clean-up module260 without being replaced. The clean- 
up module 260 stylizes each annotation by initially de- 
termining its type or classification. Then, based on the 
set of clean-up rules for the annotation's particular type, 

30 the annotation is converted to a stylized form. 

[0036] Referring to FIGS. 2A and 2B, the digital ink 
annotation system 100 also includes a user feedback 
module 280 that provides an interface whereby the user 
can interact with the digital ink annotation system 100. 

35 This user feedback insures the accuracy and efficacy of 
the system 100. The feedback user interface 280 can 
control aspects of the classification module 200, the an- 
choring module 210, the reflow module 250 and the 
clean-up module 260, and thus allows the user to inter- 

40 act with and override the automatic functionality of these 
modules. The dashed lines connecting the feedback us- 
er interface 280 to these four modules indicates that us- 
er feedback is optional in each case. 
[0037] By way of example, the classification module 

45 200 may display an annotation's grouping, classifica- 
tion, and anchor context via the feedback user interface 
280. The user can inspect this information and may 
choose to modify it, for instance by changing the group- 
ing of ink strokes that compose the annotation, or 

so changing the annotation's classification, or changing the 
region in the document with which the annotation is as- 
sociated. The user may use the feedback user interface 
280 to modify how the anchoring module 21 0 generates 
logical anchors for annotations, for instance by chang- 

55 ing the type of features that are extracted from the digital 
document 160 to describe anchor contexts. The user 
may use the feedback user interface 280 to manually 
specify which set of rules the reflow module 250 should 
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use when reflowing a particular annotation, for instance 
if the default set of reflow rules for the annotation's type 
are incorrect considering the annotation's particular 
characteristics. Finally, the user may use the feedback 
user interface 280 to manually specify which set of rules 
the clean-up module 260 should use when displaying a 
formalized or "cleaned-up" version of an annotation, for 
instance if the default set of "clean-up" rules for the an- 
notation's type is incorrect given the annotation's partic- 
ular characteristics. Through the feedback user inter- 
face 280, a user is capable of interacting with the system 
100 to further enhance its accuracy, reliability and effec- 
tiveness. 

[0038] FIG. 3 is a block diagram illustrating the details 
of an exemplary implementation of the classification 
module 200 shown in FIG. 2A. Generally, the classifica- 
tion module 200 uses features of the user annotations 
170 and contextual information from the digital docu- 
ment 160 to group and classify each annotation and to 
determine an appropriate anchor context, and outputs 
grouping, classification, and anchor context combina- 
tions 230 for each annotation. The classification module 
200 includes an ink stroke feature extraction module 
300; a grouping and classification module 31 0 having a 
temporal pruning module 320 and a spatial pruning 
module 330; an anchor context identification module 
340; and a manual grouping, classification, and anchor 
identification module 350. 

[0039] The ink stroke feature extraction module 300 
measures the characteristics of the ink strokes in the 
user annotations 1 70. For a particular ink stroke these 
characteristics may include, for example, the angle of 
orientation of the stroke, the length of the stroke, the 
time at which the stroke was made relative to all other 
ink strokes, and the position of the stroke with respect 
to all the other ink strokes. These characteristics are fed 
into the grouping and classification module 310 to de- 
termine the correct grouping of ink strokes together. 
[0040] The grouping and classification module 31 0 is 
responsible for grouping the ink strokes in user annota- 
tions 170 into distinct groups with appropriate class la- 
bels and anchor contexts in a globally optimal way. In 
general, the grouping and classification module 31 0 us- 
es dynamic programming to evaluate all possible group- 
ings and classifications of the inkstrokes comprising the 
user annotations 1 70, and to determine the optimal - or 
most likely correct - grouping and classification of all 
strokes into distinct, all-inclusive, non-overlapping 
groups. Evaluating all possible groupings and classifi- 
cations is combinatorially expensive in terms of compu- 
ter processing time, so the dynamic programming 
search space is pruned upfront based on the temporal 
ordering of strokes (using the temporal pruning module 
320), or the spatial arrangement of strokes (using the 
spatial pruning module 330), or both. In other words, on- 
ly groups of strokes that are proximate to one another 
in time and/or space are considered as part of the group- 
ing and classification module's 310 dynamic program. 
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The justification for this upfront pruning is that ink 
strokes that are distant from one another in time or 
space are unlikely to belong together in a group. 
[0041] Each of the remaining possible groups of 
5 strokes that is evaluated by the grouping and classifica- 
tion module 31 0 is assigned a probability based on how 
likely the strokes in the g roup are to form one of the types 
of annotation the system recognizes (e.g. underline, 
highlight, circle, margin comment, margin bar, or con- 
10 nector), and then the group is passed on to the anchor 
context identification module 340. The anchor context 
identification module 340 then attempts to identify the 
most likely anchor context for the group. For example, 
if the inkstrokes in a particular group are horizontal lines 
is and the grouping and classification module 31 0 has de- 
termined that the group probably represents an under- 
line annotation, the anchor context identification module 
440 will look for words in the document that lie directly 
above the lines. If the ink strokes in another group are 
20 determined to most likely form a circle annotation, the 
anchor context identification module 340 will look for the 
words that are enclosed by the circle. The anchor con- 
text identification module makes its best guess as to the 
anchor context for the group of ink strokes it has been 
25 presented with, assigns a probability that the anchor 
context is correct, and passes this information back to 
the grouping and classification module 310. 
[0042] The grouping and classification module 31 0 re- 
ceives the anchor context identification information for 
30 the group, combines it with the original classification 
probability that it calculated for the group, and places 
the combined value in its dynamic programming sub- 
problem solution table. If a particular grouping of ink 
strokes has an unlikely classification and/or an unlikely 
35 anchor context (for instance, there are no words above 
the lines in a group of ink strokes that were initially clas- 
sified as an underline), this will reduce the likelihood that 
this particular group is included in the final total grouping 
of all strokes in user annotations 1 70 determined by the 
40 grouping and classification module 310. When the 
grouping and classification module 310 has evaluated 
all possible groupings (that were not pruned upfront), it 
walks through its subproblem solution table and choos- 
es the set of grouping/classification/anchor context 
45 combinations that account for all strokes in user anno- 
tations 170, do not overlap, and maximize the combined 
grouping probability. This complete optimal grouping is 
then output as the grouping, classification, and anchor 
context combinations 230. 
so [0043] The manual grouping, classification, and an- 
chor identification module 350 provides a user interface 
by which the user can manually select a set of ink 
strokes, classify it according to the list of recognized an- 
notation types, and specify its anchor context. If the user 
55 elects to use this interface, either to circumvent the au- 
tomatic grouping process or to correct a mistake in the 
automatic process, whatever is manually chosen is re- 
garded as better than the automatic process can 
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achieve, and the ink strokes in the grouping are subse- 
quently ignored by the automatic process. 
[0044] FIG. 4 is a block diagram illustrating the details 
of an exemplary implementation of the anchoring mod- 
ule 210 shown in FIG. 2A. In general, the anchoring 
module 210 uses the grouping, classification, and an- 
chor context combinations 230 output by the classifica- 
tion module 200 to generate logical anchors that asso- 
ciate each group of strokes with its intended anchor re- 
gion in the document, in such a way that the anchor re- 
gion for each group can be recovered even if the layout 
and/or content of the document is changed. The anchor- 
ing module 210 includes annotation anchoring rules 
400, the context feature extraction module 41 0, and the 
logical anchor generator module 420. 
[0045] The anchoring module 21 0 takes as input the 
grouping, classification, and anchor context combina- 
tions 230 output by the classification module 200 and 
the digital document 160. The annotation anchoring 
rules 400 dictate how each distinct annotation type is 
anchored. For example, an underline annotation is an- 
chored to the text that it underlines, while a circle is an- 
chored to the text that it encloses. The context feature 
extraction module 410 extracts salient anchoring fea- 
tures from the digital document 160 and makes them 
available to the logical anchor generator module 420. 
For each ink stroke grouping, the logical anchor gener- 
ator module 420 chooses the appropriate anchor fea- 
tures, from among those extracted, based on the group- 
ing's classification and the anchoring rules for that type 
of annotation. The anchor features are assigned to the 
grouping as the grouping's logical anchor. The grouping, 
along with its classification, its anchor context, and its 
logical anchor, is then considered a fully reflowable an- 
notation. After the logical anchor generator module 420 
has completed its task for all grouping, classification, 
and anchor context combinations 230, the fully reflow- 
able annotations are ready for display on the digital doc- 
ument 1 60, and a digital document with properly flowed 
annotations 130 is produced. 

[0046] FIG. 5 is a block diagram illustrating the details 
of an exemplary implementation of the reflow module 
250 shown in FIG. 2B. In general, the reflow module 250 
transforms and rerenders each fully reflowable annota- 
tion on a modified annotated digital document 135 
based on the annotation's logical anchor and its classi- 
fication. The reflow module 250 includes a layout deter- 
mination module 500, an annotation transformation 
module 51 0, and a rerendering module 520, and it out- 
puts a new digital document with properly flowed anno- 
tations 270. 

[0047] The reflow module 250 takes as its input a 
modified annotated digital document 1 35 which has had 
its layout modified since the last time the annotations on 
the document were reflowed. For each annotation on 
the document, the layout determination module 500 re- 
covers the annotation's anchor context using the anno- 
tation's logical anchor, and determines the new layout 
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of the annotation's anchor context. Once the new layout 
is determined, the annotation transformation module 
51 0 is used to transform the annotations such that they 
conform to the new document layout (e.g. they "keep 
5 up" with the position in the document where they were 
originally intended to go). Conformation processing is 
different for each distinct annotation type. For underline 
annotations, for instance, it includes join operations to 
join two or more underline ink strokes that fall on the 
to same line of text, and split operations to split an under- 
line ink stroke that spans two or more lines of text in the 
new layout. For circle annotations, it includes affine 
transformations (translations, rotations, and scalings) 
so they agree with the new layout of their anchor con- 
's texts. 

IV. Operational Overview 

[0048] The digital ink annotation system 100 dis- 
20 closed herein uses a digital ink annotation process to 
process the annotations such that the annotations are 
dynamically adaptable. This means that when a docu- 
ment containing the annotations is modified the anno- 
tations can adapt to a new layout of the document. This 
25 dynamic adaptability insures that the original intent and 
meaning of each annotation is preserved. 
[0049] The digital ink annotation process includes 
three main features. First, the ink strokes of a digital ink 
annotation are grouped and classified, and a likely an- 
30 chor context is identified. Second, a logical anchor is 
generated for the classified annotation which identifies 
the region in the document with which the annotation is 
associated in such a way that the region can be recov- 
ered if the document's layout and/or content change. 
35 Anchoring the annotation insures that the annotation will 
be placed in the correct region of the document even 
after the document is modified. Third, the processed dig- 
ital ink annotations are reflowed or rerendered if the doc- 
ument's layout changes. 
40 [0050] FIG. 6 is a general flow diagram illustrating the 
operation of the digital ink annotation process of the dig- 
ital ink annotation system 1 00 shown in FIGS 1 , 2A and 
2B. The method begins by inputting a digital ink anno- 
tation and a digital document (box 600). The digital ink 
45 annotation then is classified, and an anchor context is 
identified (box 610). This classification is performed 
based on annotation characteristics and contextual in- 
formation in the digital document. Next, the annotation 
is robustly anchored to a region or position in the digital 
so document (box 620). This anchoring is based on the an- 
notation classification and the anchor context that was 
identified. Finally, the digital ink annotation is reflowed 
in the new layout of the digital document (box 630). This 
reflow process is based on the annotation anchor and 
55 the annotation classification. 
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V. Operational Details 

[0051] FIG. 7 is a detailed flow diagram illustrating the 
operation of the classification process for the digital ink 
annotation process shown in FIG. 6. The method begins 5 
by grouping the ink strokes on a digital document in all 
possible ways, pruning according to time and space 
proximity constraints, and evaluating each group for the 
likelihood that the combination of ink strokes it contains 
conforms to the characteristics of each annotation class 10 
(box 700). Then, the most likely anchor context is deter- 
mined for each grouping/classification combination (box 
71 0). A dynamic programming sub-problem solution ta- 
ble is filled-in with the likelihood of each grouping/clas- 
sification/context anchor combination (box 720). And fi- is 
nally, after all sub-problems (that were not pruned) have 
been evaluated, the sub-problem solution table is 
walked to determine the optimal total grouping of ink 
strokes which accounts for all strokes, does not include 
overlapping groups, and maximizes the combined prob- 20 
ability of all groups (box 730). 

[0052] When a user marks up the digital document, 
the process groups ink strokes to define what type of 
annotation the strokes comprise. For example, the an- 
notation might be an underline, circle, connector, margin 25 
bar, or margin comment. The grouping takes place at 
roughly the same level of abstraction and with the same 
accuracy as a human would group the ink strokes and 
define an annotation. 

[0053] Combining ink strokes into groups may be 30 
based on the temporal order in which the strokes were 
created, or the spatial arrangement of strokes, or a com- 
bination of both. This is another way of saying that the 
grouping and classification algorithm's dynamic pro- 
gramming search space is pruned based on temporal 35 
and spatial proximity constraints. Relying on temporal 
order is fast and easy. In addition, temporal grouping 
exploits the fact that the ink strokes used in most anno- 
tations are created sequentially. However, there may be 
times when strokes are created out of order. For exam- to 
pie, when a user goes back and dots an "i" or crosses 
a "t". In this situation, spatial arrangement becomes a 
better criteria for grouping. 

[0054] Digital ink annotations are ambiguous by na- 
ture and improper classification can yield confusing be- -*5 
havior. For example, if a horizontal arrow pointing to the 
text is erroneously classified as an underline, the arrow 
may be split across multiple lines when reflow occurs. 
Thus, in some embodiments completely automated 
grouping and classification may not be feasible. In these so 
embodiments, a hybrid approach may be used, such as 
requesting a user's help when the confidence estimate 
for a total grouping is below a certain threshold. 
[0055] FIG. 8A illustrates a user annotating a digital 
document by creating digital ink strokes on the docu- 55 
ment. In this case, the word "how?" has been written in 
the margin and the text referred to by the annotation has 
been circle. FIG. 8B illustrates the manual grouping and 
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classification of the annotation. FIG. 8C illustrates the 
reflowing of the document and annotation using the dig- 
ital ink annotation process (regardless of whether the 
annotation was automatically or manually grouped and 
classified). Note that the annotation in the reflowed doc- 
ument in FIG 8C "agrees" with the annotation in the orig- 
inal document of FIG. 8A because it has been moved - 
or reflowed - to keep up with where it belongs in the doc- 
ument. 

[0056] FIG. 9 is a detailed flow diagram illustrating the 
operation of the logical anchoring process for the digital 
ink annotation process shown in FIG. 6. Logical anchor- 
ing insures that when the document layout changes and 
its annotations are consequently reflowed, the original 
meaning and intent of the annotations are preserved. 
For example, if an annotation has been classified as an 
underline, then the annotation must remain under the 
range of text being underlined. This may require splitting 
or joining of ink strokes comprising the annotation de- 
pending on if the text range to which the annotation is 
anchored is reflowed. If, on the other hand, the annota- 
tion has been classified as an arrow connector, then it 
must remain pointing at what it was originally pointing 
to and from. The arrow connector, however, can be oth- 
erwise stretched, rotated and translated during reflow. 
[0057] After the ink strokes of the annotation are 
grouped and the annotation is classified, the classified 
annotation must be logically anchored to its intended 
content. In other words, the annotation must be an- 
chored to a region or position in the document where it 
belongs such that the meaning and intent of the anno- 
tation can be recovered even if the document's layout, 
format, or content changes. 

[0058] Annotations made with a pen on paper are af- 
fixed to a particular position on a paper document. How- 
ever, physical position in a digital document loses its 
meaning when the digital document is reflowed. Instead, 
the annotation must be anchored to its surrounding log- 
ical context (such as the range of text the annotation is 
near). This is challenging for two reasons. First, digital 
ink annotations often do not offer a strong indication of 
where they should be anchored. For instance, com- 
ments in the margin may pertain to a text range in the 
immediate vicinity or on the other side of the page. Sec- 
ond, the document may be edited between when it was 
originally annotated and when the annotation is dis- 
played. This makes it harder to recover the anchor con- 
text of the annotation. 

[0059] One way to achieve a robust anchoring is to 
employ sophisticated automatic analysis of document 
contents. These robust anchoring techniques are 
known in the art. For example, one robust anchoring 
technique that may be used with the digital ink annota- 
tion process is found in apaper by A.J. Brush, D. Barger- 
on, A. Gupta and J.J. Cadiz entitled "Robust Annotation 
Positioning in Digital Documents", in Proceedings of 
CHI 2001, pp. 285-292, ACM Press 2001. Automatic 
techniques, however, may still result in errors, and the 
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digital ink annotation process also includes a provision 
for asking a user to explicitly specify the anchor for a 
given annotation if it is determined that the annotation's 
anchor cannot be recovered from its logical anchor (if, 
for instance, the anchor context the logical anchor de- 
scribes has been removed from the document). 
[0060] Referring to FIG. 9, the anchoring method be- 
gins by extracting anchor features from the digital doc- 
ument that is being annotated (box 900). Next, anchor 
features are chosen for each group of ink strokes pro- 
duced by the grouping and classifying process in FIG 7 
based on the group's classification and anchor context, 
and according to a set of anchoring rules for each an- 
notation type (box 910). These per-annotation-type 
rules are discussed in more detail below. Finally, the ap- 
propriate anchor features are assigned to each ink 
stroke group to form fully reflowable annotations. After 
all annotations have been anchored to a document, the 
annotations are ready to be reflowed such that if the doc- 
ument layout changes, the annotations can keep up with 
where they belong in the document and can retain their 
visual meanings. 

[0061] FIG. 10 is a detailed flow diagram illustrating 
the operation of the reflow classification process for the 
digital inkannotation processshown in FIG. 6. If the orig- 
inal document changes, the reflow process rerenders 
annotations based on their classifications and their log- 
ical anchors to "keep up" with where they belong in the 
document. For example, if a sentence is underlined and 
the column in which the sentence resides is made thin- 
ner, the sentence may have to be reflowed to more lines 
than it originally occupied. In this case, the underline ink 
annotation must be divided and displayed on more lines 
than it was originally drawn. As another example, if a 
paragraph has been circled and the column in which it 
resides is made wider, then the circle inkannotation sur- 
rounding the paragraph must be stretched wide and 
made shorter. 

[0062] The reflow process begins by determining the 
new layout of a digital document (box 1000). Next, a dig- 
ital ink annotation is transformed to conform to the new 
layout (box 1 01 0). Finally, the newly transformed digital 
ink annotation is rerendered over the new document lay- 
out (box 1020). 

[0063] The digital ink annotation process automatical- 
ly recognizes the six most common types or classifica- 
tions of "active reading"-type digital ink annotations, and 
the system is extensible so that it can recognize more 
annotation types (such as editing marks) in the future. 
The types of annotations that are automatically recog- 
nized include underline annotations, highlight annota- 
tions, marginalia or "margin comment" annotations, cir- 
cle annotations, margin bracket annotations, and callout 
or "connector" annotations. The annotation characteris- 
tics, context, and anchoring rules for each of these an- 
notation classifications will now be discussed. 
[0064] In order to illustrate the principles described 
above, FIGS. 12A-G are offered as an example of how 



the digital ink annotation process can operate. In FIGS. 
12A-G, column "A" represents a user's annotation of a 
digital document using digital ink strokes; column "B" 
represents the annotation reflowed in a new document 
5 layout; and column "C" represents the "clean-up" anno- 
tation that has been stylized by the clean-up module. 

Underline Annotations 

w [0065] Referring to FIG. 11 A, underline annotations 
are composed of one or more approximately horizontal 
straight-line ink strokes. In the context of a digital doc- 
ument, underline annotations are typically found under 
text, especially one or more words, 
is [0066] Each ink stroke in an underline annotation is 
anchored separately to the range of text that it under- 
lines. The words within the range of text and their se- 
quence is known, and this information is used to control 
how the ink stroke is reflowed when and if the range of 
20 text is reflowed in the document. 

[0067] During reflow, if multiple strokes fall adjacent 
to one another on the same line of text, they will be fused 
together; and if one underline stroke happens to fall 
across two or more lines, it will be divided and its parts 
25 will be moved appropriately to remain under the origi- 
nally underlined context. 

Highlight Annotations 

30 [0068] Referring to FIG. 1 1 B, the unique characteris- 
tics of highlight annotations include the fact that they are 
like a strikethrough, except that the ink strokes compris- 
ing a highlight are thick and the ink is partially transpar- 
ent. Otherwise, highlight annotations are treated simi- 
35 larly to underlines: The context is determined in a similar 
manner to underline annotations, except that highlight 
ink strokes are approximately directly over the range of 
text to which they should be anchored (as opposed to 
lying under the line of text, as is the case with underline 
40 annotations). And the reflow rules for highlight annota- 
tions are also similar to underline annotations, except 
that during reflow the ink stroke is on the text range and 
the ink is wide and partially transparent. 



[0069] Referring to FIG. 11C, marginalia or margin 
comment annotations are defined as any digital ink 
stroke that is in the margin or white space of the digital 
document, but is not otherwise a connector. Marginalia 
includes drawings, symbols (such as "I" and "*"), and 
text within the margins or white space. 
[0070] The rule for anchoring marginalia annotations 
is that they are anchored to whatever portion of the doc- 
ument to which they are horizontally adjacent. Atop hor- 
izontal line, corresponding to the top of the marginalia 
annotation's bounding box, and a bottom horizontal line, 
corresponding to the bottom of the marginalia annota- 
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tion's bounding box, are projected across the document 
from wherever the marginalia annotation is positioned. 
The document content in between the top and bottom 
horizontal lines are considered marginalia annotation's 
anchor context. 

[0071] When the document's layout changes, margin- 
alia annotations are translated vertically such that they 
remain vertically centered next to their anchor contexts. 
For example, assume that a marginalia annotation in an 
original digital document is anchored to four lines of text. 
Next, assume that the layout of the digital document is 
modified such that the four lines of text becomes eight 
lines of text. In this case the anchored marginalia anno- 
tation is centered in the vertical direction, adjacent to 
the eight lines of text. 

Circle Annotations 

[0072] Referring to FIG. 11 D, circle annotations are 
divided into two types: "inline" circles and a "region" cir- 
cles. Inline circles typically circumscribe a single line of 
text, and region circles typically circumscribe multiple 
partial lines of text or other document content. A group 
of ink strokes may be recognized as some kind of circle 
if the strokes form a shape with no concavities, and 
groups of ink strokes that form circles may be differen- 
tiated into inline circles and region circles depending on 
their likely anchor context. For instance, if the circle en- 
closes one line of text, it can be labeled an Inline circle; 
while if the circle encloses text that falls on more than 
one line, it can be labeled a region circle. 
[0073] Circles of either type are anchored to the text 
and other document content they circumscribe. 
[0074] During reflow, the inline circle annotation type 
is typically translated horizontally to keep up with its an- 
chor context until its anchor context (e.g. the range of 
text it circumscribes) splits across two or more lines. 
When a split such as this occurs, the circle is copied, 
scaled, and translated such that the two or more por- 
tions of text comprising the original anchor context are 
individually circled. Referring to FIG. 11E, region circle 
annotations are scaled and translated to keep encircling 
their anchor contexts, but they are not copied. 

Margin Bracket Annotations 

[0075] Referring to FIG. 11 F, margin bracket annota- 
tions are similar to marginalia annotations. Groups of 
ink strokes are recognized as margin bracket annota- 
tions if they form approximately vertical lines in the doc- 
ument margins which are "close" to document content. 
[0076] Similarly to how marginalia annotations are an- 
chored, margin bracket annotations are anchored to 
whatever portion of the document to which they are hor- 
izontally adjacent. Top and bottom horizontal lines are 
generated, and the document content in between the 
top and bottom horizontal lines is considered the margin 
bracket annotation's anchor. 



[0077] Unlike marginalia annotations, when the text 
range to which a margin bracket annotation is anchored 
is reflowed, the marginal bracket annotation is translat- 
ed and scaled accordingly in the vertical direction, so 
5 that the bracket appears to stretch to keep bracketing 
its entire anchor context. 

Callout or Connector Annotations 

10 [0078] Referring to FIG. 11 G, callout or connector an- 
notations are annotations that link regions by going from 
one position to another position. One characteristic of 
the ink strokes comprising a callout or connector anno- 
tation is that they are not straight, but may have some 
is curvature. Another characteristic is that the callout or 
connector annotation is at an angle to the horizontal and 
vertical directions. Still another characteristic is that the 
ends of the callout or connector annotation are adjacent 
to text or content of the document, digital ink strokes 
20 from other annotations, or both. In other words, the call- 
out or connector annotations are connecting text or con- 
tent with an annotation or other text or content. 
[0079] The anchoring rules for callout or connector 
annotations are based on anchoring the callout's end- 
25 points. For each endpoint, the document content or oth- 
er freeform digital ink annotation closest to the endpoint 
is considered its anchor context. Thus, each callout has 
two separate and distinct anchor contexts. 
[0080] During reflow, the callout is translated, rotated, 
30 and scaled such that each endpoint remains constant 
relative to its anchor context. This can be seen in FIG 
11G. 

VI. Working Example 

35 

[0081] In order to more fully understand the digital ink 
annotation method and system disclosed herein, the de- 
tails of an exemplary working example are presented. It 
should be noted that this working example is only one 
40 way in which the digital ink annotation process and sys- 
tem disclosed herein may be implemented. 

Hardware 

45 [0082] For the working example, the digital ink anno- 
tation method was implemented to run on any Tablet PC 
hardware running Microsoft WindowsXP Tablet PC Edi- 
tion operating system. These kind of computers support 
a pen input digitizer integrated into their display screens, 

so thus allowing the computer to mimic the form factor of 
a writing pad. Users interact with the computer using a 
stylus, which enables a user to control the cursor and 
write directly on the screen. 

55 Software 

[0083] In this working example, the digital ink annota- 
tion process and system disclosed herein was embod- 
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ied as a plug-in for Microsoft Internet Explorer (IE). This 
plug-in (code-named "Callisto") supports an IE toolbar 
with pen and highlighter tools that allow a user to mark 
any part of any web page with digital ink. The ink strokes 
persist in a local cache on the user's machine. FIG. 12 
illustrates the toolbar of the digital ink annotation proc- 
ess and system that facilitates drawing and classifying 
digital annotations on a digital document. 

Grouping, Classifying and Anchoring 

[0084] In order to group and classify raw digital ink 
strokes into digital annotations, the digital ink annotation 
process and system embodied in Callisto allows a user 
to select automatic grouping and classification, or man- 
ual grouping and classification. If automatic grouping 
and classification is chosen, the Callisto software makes 
automatic decisions about grouping and classifying the 
user's ink strokes, and automatically determines anno- 
tation anchors, according to the process detailed in sec- 
tions III, IV, and V above. 

[0085] If manual grouping and classification is cho- 
sen, the user first selects the ink strokes he wishes to 
group together using the selection tool. Next, the user 
chooses a classification from the predetermined list of 
annotation types. Referring backto FIG. 9B, a user man- 
ually selects an annotation classification from a prede- 
termined list of annotations. This list of annotation types 
includes underlines, highlights, marginalia (such as 
margin comments and symbols), circles, margin bars, 
and connectors. Choosing one of these annotation 
types automatically anchors the selected group of 
strokes as an annotation to a context in the document. 
This anchoring is performed using the set of rules out- 
lined in Section V above. 

Reflowing Digital Annotations 

[0086] After grouping, classification and anchoring, 
wheneverthe IE document window is resized the digital 
ink annotation process and system automatically re- 
flows any annotations on the current page so that they 
keep up with their respective anchor contexts per the 
process outlined in Sections III, IV, and V above. 

Cleaning-Up Digital Annotations 

[0087] Besides reflowing the digital annotations to 
preserve the user's original drawing style, the digital ink 
annotation process and system embodied in the Callisto 
plug-in for I E can also draw formalized "cleaned-up" ver- 
sions of digital annotations per the process outlined in 
Section III above. In this working example, the digital ink 
annotation process and system cleaned-up the annota- 
tions as follows: The digital ink annotation process and 
system rendered underline annotations as straight line 
segments, highlight annotations as translucent horizon- 
tal rectangles over the anchor text, circles as round-cor- 



nered rectangles, margin bars as simple Bezier curves 
spanning the height of the anchor text, and connectors 
are straight lines running between their endpoint anchor 
contexts. Margin comments and symbols were not 
5 parsed and were left as the user drew them, since they 
typically contain so much variance and individual style 
it is very difficult to transform them into an acceptable 
formalized style. 



[0088] The digital ink annotation process and system 
disclosed herein is designed to operate in a computing 
environment. The following discussion is intended to 
provide a brief, general description of a suitable com- 
puting environment in which the digital ink annotation 
process and system may be implemented. 
[0089] FIG. 13 illustrates an example of a suitable 
computing system environment 1 300 in which the digital 
ink annotation process and system may be implement- 
ed. The computing system environment 1300 is only 
one example of a suitable computing environment and 
is not intended to suggest any limitation as to the scope 
of use or functionality of the invention. Neither should 
the computing environment 1 300 be interpreted as hav- 
ing any dependency or requirement relating to any one 
or combination of components illustrated in the exem- 
plary operating environment 1300. 
[0090] The digital ink annotation process and system 
is operational with numerous other general purpose or 
special purpose computing system environments or 
configurations. Examples of well known computing sys- 
tems, environments, and/or configurations that may be 
suitable for use with the digital ink annotation process 
and system include, but are not limited to, personal com- 
puters, server computers, hand-held, laptop or mobile 
computer or communications devices such as cell 
phones and PDA's, multiprocessor systems, microproc- 
essor-based systems, set top boxes, programmable 
consumer electronics, network PCs, minicomputers, 
mainframe computers, distributed computing environ- 
ments that include any of the above systems or devices, 
and the like. 

[0091] The digital ink annotation process and system 
may be described in the general context of computer- 
executable instructions, such as program modules, be- 
ing executed by a computer. Generally, program mod- 
ules include routines, programs, objects, components, 
data structures, etc., that perform particular tasks or im- 
plement particular abstract data types. The digital ink 
annotation process and system may also be practiced 
in distributed computing environments where tasks are 
performed by remote processing devices that are linked 
through a communications network. In a distributed 
computing environment, program modules may be lo- 
cated in both local and remote computer storage media 
including memory storage devices. With reference to 
FIG. 13, an exemplary system for implementing the dig- 
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ital ink annotation process and system includes a gen- 
eral-purpose computing device in the form of a compu- 
ter 1 31 0 (the computer 1 31 0 is an example of the com- 
puting device 1 40 shown in FIG. 1 ). 
[0092] Components of the computer 1310 may in- 
clude, but are not limited to, a processing unit 1320, a 
system memory 1 330, and a system bus 1 321 that cou- 
ples various system components including the system 
memory to the processing unit 1320. The system bus 
1321 may be any of several types of bus structures in- 
cluding a memory bus or memory controller, a peripheral 
bus, and a local bus using any of a variety of bus archi- 
tectures. By way of example, and not limitation, such 
architectures include Industry Standard Architecture 
(ISA) bus, Micro Channel Architecture (MCA) bus, En- 
hanced ISA (EISA) bus, Video Electronics Standards 
Association (VESA) local bus, and Peripheral Compo- 
nent Interconnect (PCI) bus also known as Mezzanine 
bus. 

[0093] The computer 1310 typically includes a variety 
of computer readable media. Computer readable media 
can be any available media that can be accessed by the 
computer 1310 and includes both volatile and nonvola- 
tile media, removable and non-removable media. By 
way of example, and not limitation, computer readable 
media may comprise computer storage media and com- 
munication media. Computer storage media includes 
volatile and nonvolatile removable and non-removable 
media implemented in any method or technology for 
storage of information such as computer readable in- 
structions, data structures, program modules or other 
data. 

[0094] Computer storage media includes, but is not 
limited to, RAM, ROM, EEPROM, flash memory or other 
memory technology, CD-ROM, digital versatile disks 
(DVD) or other optical disk storage, magnetic cassettes, 
magnetic tape, magnetic disk storage or other magnetic 
storage devices, or any other medium which can be 
used to store the desired information and which can be 
accessed by the computer 1310. Communication media 
typically embodies computer readable instructions, data 
structures, program modules or other data in a modu- 
lated data signal such as a carrier wave or other trans- 
port mechanism and includes any information delivery 
media. 

[0095] Note that the term "modulated data signal" 
means a signal that has one or more of its characteris- 
tics set or changed in such a manner as to encode in- 
formation in the signal. By way of example, and not lim- 
itation, communication media includes wired media 
such as a wired network or direct-wired connection, and 
wireless media such as acoustic, RF, infrared and other 
wireless media. Combinations of any of the above 
should also be included within the scope of computer 
readable media. 

[0096] The system memory 1330 includes computer 
storage media in the form of volatile and/or nonvolatile 
memory such as read only memory (ROM) 1331 and 
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random access memory (RAM) 1 332. A basic input/out- 
put system 1333 (BIOS), containing the basic routines 
that help to transfer information between elements with- 
in the computer 1310, such as during start-up, is typi- 
5 cally stored in ROM 1331. RAM 1332 typically contains 
data and/or program modules that are immediately ac- 
cessible to and/or presently being operated on by 
processing unit 1 320. By way of example, and not limi- 
tation, FIG. 13 illustrates operating system 1334, appli- 
cation programs 1335, other program modules 1336, 
and program data 1337. 

[0097] The computer 1310 may also include other re- 
movable/non-removable, volatile/nonvolatile computer 
storage media. By way of example only FIG. 13 illus- 
trates a hard disk drive 1341 that reads from or writes 
to non-removable, nonvolatile magnetic media, a mag- 
netic disk drive 1351 that reads from or writes to a re- 
movable, nonvolatile magnetic disk 1 352, and an optical 
disk drive 1 355 that reads from or writes to a removable, 
nonvolatile optical disk 1356 such as a CD ROM or other 
optical media. 

[0098] Other removable/non-removable, volatile/non- 
volatile computer storage media that can be used in the 
exemplary operating environment include, but are not 
limited to, magnetic tape cassettes, flash memory cards, 
digital versatile disks, digital video tape, solid state 
RAM, solid state ROM, and the like. The hard disk drive 
1341 is typically connected to the system bus 1321 
through a non-removable memory interface such as in- 
terface 1340, and magnetic disk drive 1351 and optical 
disk drive 1355 are typically connected to the system 
bus 13221 by a removable memory interface, such as 
interface 1350. 

[0099] The drives and their associated computer stor- 
age media discussed above and illustrated in FIG. 13, 
provide storage of computer readable instructions, data 
structures, program modules and other data for the 
computer 1310. In FIG. 13, for example, hard disk drive 
1341 is illustrated as storing operating system 1 344, ap- 
plication programs 1345, other program modules 1346, 
and program data 1347. Note that these components 
can either be the same as or different from operating 
system 1334. application programs 1335, other pro- 
gram modules 1 336. and program data 1 337. Operating 
system 1344. application programs 1345, other pro- 
gram modules 1346, and program data 1347 are given 
different numbers here to illustrate that, at a minimum, 
they are different copies. A user may enter commands 
and information into the computer 1310 through input 
devices such as a keyboard 1362 and pointing device 
1361, commonly referred to as a mouse, trackball or 
touch pad. 

[0100] Other input devices (not shown) may include 
a microphone, joystick, game pad, satellite dish, scan- 
ner, radio receiver, or a television or broadcast video re- 
ceiver, or the like. These and other input devices are 
often connected to the processing unit 1320 through a 
user input interface 1360 that is coupled to the system 
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bus 1 321 , but may be connected by other interface and 
bus structures, such as, for example, a parallel port, 
game port or a universal serial bus (USB). A monitor 
1391 or other type of display device is also connected 
to the system bus 1 321 via an interface, such as a video 
interface 1390. In addition to the monitor, computers 
may also include other peripheral output devices such 
as speakers 1397 and printer 1396, which may be con- 
nected through an output peripheral interface 1395. 
[0101] The computer 1310 may operate in a net- 
worked environment using logical connections to one or 
more remote computers, such as a remote computer 
1380. The remote computer 1380 may be a personal 
computer, a server, a router, a network PC, a peer de- 
vice or other common network node, and typically in- 
cludes many or all of the elements described above rel- 
ative to the computer 1310, although only a memory 
storage device 1 381 has been illustrated in FIG. 1 3. The 
logical connections depicted in FIG. 13 include a local 
area network (LAN) 1371 and a wide area network 
(WAN) 1 373, but may also include other networks. Such 
networking environments are commonplace in offices, 
enterprise-wide computer networks, intranets and the 
Internet. 

[0102] When used in a LAN networking environment, 
the computer 1310 is connected to the LAN 1371 
through a network interface or adapter 1370. When 
used in a WAN networking environment, the computer 
1310 typically includes a modem 1372 or other means 
for establishing communications over the WAN 1373, 
such as the Internet. The modem 1372, which may be 
internal or external, may be connected to the system bus 
1321 via the user input interface 1360, or other appro- 
priate mechanism. In a networked environment, pro- 
gram modules depicted relative to the computer 1310, 
or portions thereof, may be stored in the remote memory 
storage device. By way of example, and not limitation, 
FIG. 13 illustrates remote application programs 1385 as 
residing on memory device 1 381 . It will be appreciated 
that the network connections shown are exemplary and 
other means of establishing a communications link be- 
tween the computers may be used. 
[0103] The foregoing description of the invention has 
been presented for the purposes of illustration and de- 
scription. It is not intended to be exhaustive or to limit 
the invention to the precise form disclosed. Many mod- 
ifications and variations are possible in light of the above 
teaching. It is intended that the scope of the invention 
be limited not by this detailed description of the inven- 
tion, but rather by the claims appended hereto. 



Claims 

1. A method for processing a digital document con- 
taining a digital ink annotation, comprising: 

classifying thedigital inkannotationtogenerate 



26 

an annotation classification; 
anchoring the digital ink annotation to a region 
in the digital document with which the digital ink 
annotation is associated to generate an anno- 
5 tation anchor; and 

reflowing the digital ink annotation in a new lay- 
out of the digital document. 

2. The method as set forth in claim 1 , wherein the dig- 
io ital ink annotation is comprised of digital ink strokes 

and further comprising: 

grouping the digital inkstrokes created by aus- 
er together to define the digital ink annotation; 
'5 and 

classifying the digital ink annotation based on 
the grouped ink strokes. 

3. The method as set forth in claim 1 , further compris- 
20 ing classifying the digital ink annotation based on at 

least two different criteria. 

4. The method as set forth in claim 3, wherein the at 
least two criteria include: (a) annotation character- 
's istics; and (b) contextual information in the digital 

document. 

5. The method as set forth in claim 3, further compris- 
ing defining a confidence estimate of the annotation 

30 classification that determines whether the user 
should be consulted. 

6. The method as set forth in claim 5, further compris- 
ing determining that the confidence estimate is low 

35 for the annotation classification. 

7. The method as set forth in claim 5, further compris- 
ing determining that the confidence estimate for a 
grouping of digital inkstrokes falls below a threshold 

io and that manual grouping should be used. 

8. The method as set forth in claim 1 , further compris- 
ing defining anchoring rules for each classification 
of digital ink annotations. 

45 

9. The method as set forth in claim 1 , further compris- 
ing applying anchoring rules for each classification 
of digital ink annotation to generate the annotation 
anchor. 

10. The method as set forth in claim 1 , further compris- 
ing determining the validity of the annotation anchor 
based on the presence or absence of the anchor 
context for the digital ink annotation in the digital 

55 document. 

11. The method as set forth in claim 10, further com- 
prising determining whether a user interaction is de- 
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sired based on the validity of the annotation anchor. 

12. The method as set forth in claim 1 , further compris- 
ing processing the digital ink annotation to conform 
to the new layout such that the meaning and intent 
of the original digital ink annotation is preserved. 

13. The method as set forth in claim 12, further com- 
prising rendering the digital ink annotation on a dis- 
play device to conform to the new layout. 

14. A computer-readable medium having computer-ex- 
ecutable instructions for performing the method re- 
cited in claim 1. 

15. A method for processing digital ink strokes made in 
a digital document, comprising: 

grouping the digital ink strokes to define a dig- 
ital ink annotation; 

classifying the digital ink annotation based on 
annotation characteristics and contextual infor- 
mation in the digital document; and 
reflowing the digital ink annotation a modified 
version of the digital document having a new 
layout such that an intent and meaning of the 
digital ink strokes are preserved. 

16. The method as set forth in claim 15, further com- 
prising grouping the digital ink strokes based on a 
temporal ordering of the digital ink strokes. 

17. The method as set forth in claim 15, further com- 
prising grouping the digital ink strokes based on a 
spatial ordering of the digital ink strokes. 

18. The process as set forth in claim 15, further com- 
prising grouping the digital ink strokes based on a 
combination of: (a) a temporal ordering of the digital 
ink strokes; and (b) a spatial ordering of the digital 
ink strokes. 

19. One or more computer-readable media having 
computer-readable instructions thereon which, 
when executed by one or more processors, cause 
the one or more processors to implement the meth- 
od of claim 15. 

20. A computer-readable medium having computer-ex- 
ecutable instructions for adapting a digital ink anno- 
tation in adigital document to a new layout in a mod- 
ified digital document, comprising: 

grouping digital ink strokes within the digital 
document to define the digital ink annotation; 
anchoring the digital ink annotation based on 
the annotation classification to generate a dig- 
ital documentwith properly flowed annotations; 
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subsequently modifying the digital document 
with properly flowed annotations by an external 
modification process to generate the modified 
digital document having the new layout that is 
5 different from the digital document with properly 

flowed annotations; and 

reflowing the digital annotations to conform to 
the new layout of the modified digital document. 

10 21. The computer-readable medium of claim 21, further 
comprising grouping the digital ink strokes based 
on at least one of: (a) a temporal ordering of the 
digital ink strokes; (b) a spatial ordering of the digital 
ink strokes. 

75 

22. The computer-readable medium of claim 21 , where- 
in the digital ink annotations include active reading 
annotations that are made by a user to emphasize 
certain content, make comments, or record 

20 thoughts and impressions within the digital docu- 
ment. 

23. The computer-readable medium of claim 21 , where- 
in the digital ink annotation includes editing anno- 
ys tations that are used to indicate errors in the digital 

document. 

24. A process for reflowing digital ink strokes from an 
original digital document having a first layout in a 

30 modified digital document having a second layout 
that is different from the first layout, comprising: 

grouping the digital ink strokes to define a dig- 
ital ink annotation; 

35 classifying the digital ink annotation based on 

annotation characteristics and contextual infor- 
mation within the original digital document; 
anchoring the digital ink annotation to a region 
in the original digital document associated with 

40 the digital ink annotation; 

determining the second layout of the modified 
digital document; 

processing the digital ink annotation to conform 
to the second layout; and 

45 rendering the digital ink annotation in the mod- 

ified digital document such that the digital ink 
annotation conforms to the second layout and 
retains a same meaning and intent in the mod- 
ified digital document as was present in the 

so original digital document.. 

25. The process as set forth in claim 25, further com- 
prising classifying the digital ink annotation as an 
underline annotation. 

55 

26. The process as set forth in claim 26, further com- 
prising anchoring the underline annotation to the 
characters in the region that are underlined. 
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27. The process as set forth in claim 25, further com- 
prising classifying the digital ink annotation as a 
highlight annotation. 

28. The process as set forth in claim 25, further com- 5 
prising classifying the digital ink annotation as a 
marginalia annotation. 

29. The process as set forth in claim 29, wherein the 
marginalia annotation is defined as any digital ink 10 
annotation stroke that is in a margin or white space 

of the digital document. 

30. The process as set forth in claim 25, further com- 
prising classifying the digital ink annotation as a cir- is 
cle annotation. 

31. The process as set forth in claim 31, wherein the 
circle annotation is at least one of: (a) an inline cir- 
cle; (b) a region circle. 20 

32. The process as set forth in claim 32, wherein the 
inline circle circumscribes a single line of text. 

33. The process as set forth in claim 32, wherein the 25 
region circle circumscribes multiple partial line of 
text and other digital document content. 

34. The process as set forth in claim 25, further com- 
prising classifying the digital ink annotation as a 30 
margin bracket annotation. 

35. The process as set forth in claim 35, wherein the 
margin bracket annotation is defined as groups of 

ink strokes that form approximately vertical lines a 35 
margin of the digital document. 

36. The process as set forth in claim 25, further com- 
prising classifying the digital ink annotation as a 
callout annotation. 40 

37. The process as set forth in claim 37, wherein the 
callout annotation is defined as an annotation that 
groups links regions by of the digital document by 
going from one position to another position. 4 s 

38. The process as set forth in claim 37, wherein the 
callout annotation exhibits some curvature. 

39. The process as set forth in claim 37, wherein the so 
callout annotation is at an angle to horizontal and 

to vertical directions in the digital document. 

40. The process as set forth in claim 37, wherein ends 

of the callout annotation are adjacent to at least one 55 
of: (a) text of the digital document; (b) content of the 
digital document; (c) digital ink strokes from other 
annotations. 
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